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Abstract: This note presents an elementary version of Sims's algorithm for computing 
strong generators of a given perm group, together with a proof of correctness and some 
notes about appropriate low-level data structures. Upper and lower bounds on the running 
time are also obtained. (Following a suggestion of Vaughan Pratt, we adopt the convention 
that perm = permutation, perhaps thereby saving millions of syllables in future research.) 

1. A data structure for perm groups. A "perm," for the purposes of this paper, is a one-to- 
one mapping of a set onto itself. If a and (3 are perms such that a takes i j and /? takes j i— > k, 
the product a/3 takes k. We write a~ for the inverse of the perm a; hence afi = 7 iff a = 7/?". 

Let n{k) be the set of all perms of the positive integers that fix all points > k. Consider the 
following data structure: For 1 < j < A;, either akj = or a^j is a perm of n{k) that takes k 1— > j. 
Let S{k) be the set of all non-0 perms Ukj- We assume that Ukk is the identity perm; hence S{k) 
is always nonempty. 

We write r{k) for the set of all perms that can be written as products of the form ui . . . Cfc 
where each ai is in There is an easy way to test if a given perm vr G n{k) is a member of 

r{k): Let vr take k j. Then if akj = we have vr r{k); otherwise if A; = 1 we have vr G r(k); 
otherwise vr G r{k) iff vra^^. G r{k — 1). 

The data structure also includes a set T(k) C n{k) with the invariant property that each 
element of r{k) can be written as a product of elements of T{k). In other words, r{k) will be a 
subset of the group {T[k)) generated by T{k), for all k, throughout the course of the algorithm to 
be described. (Since all elements vr of n{k) are finite perms, we have vr~ = vr*" for some r > 0; 
hence closure under multiplication implies closure under inversion.) 

The data structure is said to be up-to-date of order n if r{k) 5 T{k) and if r{k) is closed 
under multiplication, i.e., if r{k) = {T{k)), for 1 < k < n. In that case we say that the perms 
[J^^i ^{k) form a transversal system of r{n), and that the perms IJfc=i ^(^) strong generators 
of r(n). Having a transversal system makes it easy to determine what perms are generated by a 
given set of perms T{n). 

2. Maintaining the data structure. Let us now discuss two algorithms that can be used to 
transform the data structure when a new perm is introduced into T{k). We will first look at the 
algorithms, then discuss why they are valid. 
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Algorithm ^^(Tr). Assuming that the data structure is up-to-date of order k, and that tt G n{k) 
but TT ^ r{k), this procedure appends tt to T{k) and brings the data structure back up-to-date so 
that r{k) will equal the new (T(fe)). 

Step Al. Insert tt into the set T{k). 

Step A2. Perform algorithm Bk{(JT) for all a G ^{k) and r € such that ctt is not already 

known to be a member of r{k). (Algorithm may increase the size of S{k)\ any new 
perms a that are added to S{k) must also be included in this step. Implementation 
details are discussed in Section 3 below.) | 

Algorithm Bkiir). Assuming that the data structure is up-to-date of order k — 1, and that vr € 
{T(k)), this procedure ensures that tt is in r{k) and that the data structure remains up-to-date of 
order A; — 1. (The value of k will always be greater than 1.) 
Step Bl. Let tt take k i— > j. 

Step B2. If (jfcj = 0, set akj tt and terminate the algorithm. 

Step B3. If TTCT^^ G r{k — 1), terminate the algorithm. (This test for membership in r{k — 1) has 

been described in Section 1 above.) 
Step B4. Perform algorithm Ak-i{Tra'^j). | 

The correctness of these mutually recursive procedures follows readily from the stated invariant 
relations, except for one nontrivial fact: We must verify that r{k) is closed under multiplication 
at the conclusion of algorithm Akiir). This is obvious when A; = 1, so we may assume that k > 1. 
Let a and /3 be elements of r{k). By definition of r{k) we can write a = 'ya, where 7 G r{k — 1) 
and a G ^{k); and by the invariant relation r{k) C {T(k)) we can write (3 = ti . . . Tr where each 
Ti G T{k). We know that cjti G r{k), by step A2; hence ari = 711T1 for some 71 G r{k — 1) and 
some (Ji G S{k). Similarly aiT2 = 72C2, etc., and we finally obtain a/3 = 771 ... 7^0"^. This proves 
that a/3 G r{k), since 771 ... 7^ is in r{k — 1) by induction. 

3. Low-level implementation hints. Let s(k) be the cardinality of S{k) and t{k) the cardinality 
of T{k). The algorithms of Section 2 can perhaps be implemented most efficiently in practice by 
keeping a linear list of the perms r(A;, 1) . . . T(k,t{k)) of T{k), for each k, together with an array 
of pointers to the representations of each a^j for 1 < j < A, using a null pointer to represent the 
relation a^j = 0- It is also convenient to have a linear list j{k, 1) . . . j(^k,s{k)) of the indices of 
the non-0 perms akj, where j{k, 1) = k. We will see below that the algorithm often completes its 
task without needing to make many of the sets S(A;) very large; thus most of the akj are often 0. 
Pointers can be used to avoid duplications between T{k) and T{k — 1). 

There are two fairly simple ways to handle the loop over a and r in step A2; one is recursive and 
the other is iterative. The recursive method replaces step A2 by the following operation: "Perform 
algorithm Bj~{aTT) for all a in the current set I](fc)." Then step B2 is also changed: "If akj = 0, set 
(^kj TT and perform Bk{'KT) for all r in the current set T{k,), then terminate the algorithm." 

The iterative method maintains an additional table, in order to remember which pairs (cr, r) 
have already been tested in step A2. This table consists of counts c{k,i) for each k and for 
\ <i < s{k), such that the product akj{k,i)T{k,l) is known to be in r{k) for 1 < Z < c{k,i). When 
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step B2 increases the value of s{k), the newly created count c(^k,s{k)) is set to zero. Step A2 is a 
loop of the form 

while i < s{k) do 

begin while c{k,i) < t{k) do 
begin / ^ c{k, i) + 1; 
Bk{'ykj{k,i)T{k,l)); 
c{k, i) /; 
end; 

end; 

the invocation of may increase s{k), but it can change t{k') and c{k',i') only for values of k' 
that are less than k. 

The iterative method carries out its tests in a different order from the recursive method, so it 
might yield a different traversal system. 

It is convenient to represent each perm a of IJ{k) indirectly in an array q that gives inverse 
images, so that a takes q[i] i ioi 1 < i < k. All other perms tt can be represented directly 
in an array p, with tt taking i i— ^ p[i] iov 1 < i < k. To compute the direct representation d of 
the product Tra~ , wc can then simply set d[i] <— for 1 < i < k. To compute the direct 

representation d of the product air, we set c?[g[i]] p[i] ioi 1 < i < k. Thus, the elementary 
operations are fast. 

4. Upper bounds on the running time. The "inner loop" of the updating algorithms occurs 
in step B3, the membership test. Testing for membership of vr G r{k) involves multiplication by 
some sequence of non-identity perms o"^^ ^-^ , . . . ■ , where A; > /ci > • • • > fc^ > 0; so the running 
time is essentially proportional to k + ki + ■■■ + kr, which is 0{k'^) in the worst case. 

The total number of executions of Bk^ar) is s{k)t{k), and we have s{k) < k. The value of t{k) 
increases by 1 each time we perform Ak{Tr); every time we do this, we increase r{k) to a larger 
subgroup of n{k), hence t{k) cannot exceed the length of the longest chain of subgroups of the 
symmetric group n{k). A straightforward upper bound is therefore t{k) < 9{k\) = 0(A;loglogA;), 
where 9{N) is the number of prime divisors of counting multiplicity. Babai [1] has shown that 
n{k) admits no subgroup chains of length exceeding 2k — 3, when k > 2; hence we have the sharper 
estimate t{k) = 0{k). 

It follows that algorithm Bk{crT) is performed 0{k'^) times, and each occurrence of step B3 
takes 0{k'^) units of time. Summing for 1 < < n allows us to conclude that a transversal system 
for a perm group generated by m perms of n{n) can be found in at most O(n^) + 0{mn^) steps. 
(The term 0{mn'^) comes from m membership tests, which are carried out on each generator tt 
before algorithm ^„(7r) is applied.) 

The storage requirement for each non-identity perm of ^{k) or T{k) is 0{k); hence we need 
at most 0{k'^) memory cells for perms of n{k), and O(ra^) memory cells in all. 

5. A sparse example. Actual computations with these procedures rarely take as much time as 
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our worst-case estimates predict. We can learn more about the true efHciency by studying particular 
cases in detail. Let us therefore consider first the case of a group generated by a single non-identity 
perm vr G 77(n). 

We begin, of course, with akj = for I < j < k < n and T{k) = for I < k < n; 
the data structure is then up-to-date of order n, and we can perform An{n). Suppose vr takes 
n I— > ai I— ^ • • • I— > ttr-i 1-^ n. Then A„(7r) will set T(n) <— {vr} and anaj <— t^-' for 1 < j < r, and it 
will invoke A„_i(7r'') (unless tt'" is the identity perm, in which case the algorithm will terminate). 

If, for example, we have 

TT = [1, 2, 3, 4, 5, 6, 7, 14] [8, 9, 10, 13] [11, 12] 

in cycle form, the algorithm will set (Ti4,j <— vr-' for 1 < j < 8, and it will terminate with T(14) = {tt} 
and with all other T(k) empty. But if we relabel points 12 and 14, obtaining the conjugate perm 

7f = [1, 2, 3, 4, 5, 6, 7, 12] [8, 9, 10, 13] [11, 14] , 

the algorithm will act quite differently: The nontrivial perms akj and sets T{k) will now be 

0"l4,ll=7r, 0-13^9 = 7f^, Cri2,4 = Ti''' ; 

T(i4) = {7f}, r(i3) = {7f2}, r(i2) = {7fn. 

When the algorithm terminates, it has produced a transversal system by which wc can test if 
a given perm p is a power of tt or vf , respectively. In the first case this membership test involves at 
most one multiplication, by o"i4^j if p takes 14 j where j < 8. In the second case the test will 
involve three multiplications if wc have, say, p = vf ''. 

These perms vr and vf are the special case h = 4 of an infinite family of perms of degree 
n = 2^^ — 2, having cycles of lengths 2^~^, 2^*"^, . . . , 2^. In general tt will cause ~ slots akj to 
become nonempty, and it will terminate after performing ~ elementary machine steps, yielding 
a membership test whose worst-case running time is ~ n. The corresponding perm tt will cause 
only ~ Ign slots akj to become nonempty, and it will terminate after ~ 2nlgn steps, yielding a 
membership test whose worst-case running time is ~ nlgn. Thus, the algorithm's performance can 
change substantially when only two points of its input perm are relabeled. 

6. A dense example. The algorithm needs to work harder when we wish to find the group 
generated by {tt2, vts, . . . , 7r„}, where VTfc € n{k) takes k k — 1, and where the generators VTfc are 
input in increasing order of k. Then it is not difficult to verify by induction that the algorithm will 
terminate with T{k) = {7T2, ■ ■ ■ ,T^k} and with akj $ fov 1 < j < k < n. Thus, the algorithm will 
fill all of the slots akj, thereby implicitly deducing that each r{k) is the full symmetric group n{k). 

Moreover, if the recursive method of Section 3 is being used to implement step A2, the algo- 
rithm will terminate with 

o"fcj = TTfcTTfc-i . . . TTj+i , for 1 < j < A; < n. 
For after akj is defined, the modified step B2 will continue to test whether the perms 

<^kj'^2, O-kjTTa, akjTTk 
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belong to the current r{k). The first j — 2 tests will succeed; then Bk{akjTTj) will cause (Jkj-i to 
be defined. And by the time the recursive call on Bk{(TkjT^j) returns control to Bk{crkj), the values 
of (Tfci will be non-0 for all i < k: hence the remaining tests on cr^jTri for / > j will succeed. 

Let us examine the special case of this construction in which each tt^ is the simple transposition 
[k, k — 1]. How much time is taken by the Q{n^) membership tests (JkjTTi G r(k)7 We have 



o-fei = [j, J + 1, ■ ■ ■ , ^] , 

and it follows that 



o-fci , if i = i + 1; 

o-i-i,i-2cr/cj , ifi>i + l. 



Each membership test therefore involves at most two multiplications by non-identity perms, and 
the total running time of the algorithm is 0(n^). 

Another interesting special case occurs when each tt^ is the cyclic perm [k, k — 1, . . . , I]. Here 
we find that akj takes 

x-{k-j), ifx>k-j; 
k + 1 — X, if X < k — j. 



X 



It turns out that we have 



O'kjT^i 



Ck-j,i (Jk-j+i^k-j+i-i o'kj , if ? < j; 

o'k-i,i <^k-j,i o'k-j+i,k-i+i <7fc,i-i ) if 1 < j < i < fe; 

CTk-i,! crk-2,i-2 0-k-i,k-i+icrki , if j = 1 and 2 < i < k; 

ak2 ) if J = 1 and i = 2; 

Cfc-1,1 , if j = 1 and i = k; 

. o'k-j,! o-k-j+1,1 CTkj-i , if 1 < j < i = k. 



So the memberships tests need at most 4 multiplications each, and again the total running time is 
G(n4). 

In both of these special cases, it turns out that the iterative implementation of step A2 will 
also define the same perms akj- Hence the running time will be 0(n'^) under either of the imple- 
mentations we have discussed. 

It is interesting to analyze the algorithm in another special case, when there are just two 
generators (7„ = [1, 2, . . . , n] and t„ = [n — 1, . Assume that the recursive implementation is used. 
First, Algorithm y4„((T„) sets T{n) = and performs Bn((T„). Algorithm i5„((T„) sets a"„i <— a"„ 
and performs i?„(cr^), which sets (T„2 and performs i?„((7^), etc. Thus (7„j becomes a-j^ for 

all j. Second, Algorithm ^„(t„) adds t„ to T(n) and performs Bn{Tn), Bn{(JnTn), . . . , Bn{(Jll~^Tn)- 
The first of these subroutines, Bn{Tn), performs algorithm A„_i(r„<T„), which is yl„_i(c7„_i). The 
second subroutine, Bn{crnTn), performs A„_i((T„r„(T~^), which is A„_i(r„_i). Therefore we can 
use induction on n to show that akj = cr^ for all j and k. It is easy to verify that each membership 
test requires at most three nontrivial multiplications. Therefore the total running time in this 
special case comes to only B(n^), although r{n) is the full symmetric group n{n). 

7. A random example. The conditions of the construction in Section 6 allow (A; — 1)! possibilities 
for each perm tt^. Let us consider the average total running time of the algorithm when each of 
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the 1! 2! ... (n — 1)! choices of {7r2,7r3, . . . ,7r„} is equally likely. On intuitive grounds it appears 
plausible that the average running time will be 0(n^), because most of the multiplications in a 
"random" situation will be by non-identity perms. This indeed turns out to be true, at least when 
the recursive implementation of step A2 is used; but the proof is a bit delicate. 

As before, the running time is dominated by Q(n^) successful tests for membership of akjTTi in 
r{k), where k > j > I and k >i > 1 and i ^ j. We know that the total running time is O(n^), so 
we need only show that the average value is n{n^); and for this purpose it will suffice to consider 
only the membership tests with k > j > i. 

The membership test for akjiTi performs the multiplications 

and the cost is I for each multiplication such that ji ^ I. Since j > i, we always have jk = j- Let 
us fix the values k, j, i, and I, where k > j > i > 1 and k > I > i, and try to determine an upper 
bound for the probability that ji = I. The following analysis applies to any given (not necessarily 
random) sequence of perms tt/, . . . , 7r2, with tt^, . . . , 7r;+i varying randomly. 

Let i — r he the number of points < i that are fixed by the given perm tTj. By assumption, tTj 

takes i i — 1, hence r > 2. 

Our first goal is to determine the probability that we have jk-i = k—1, jk-2 = k—2, . . . , ji = I. 
This holds iff o'fcjTTjcr^^- G n{l — 1). Note that, in the recursive implementation of step A2, we have 

<^kjT^i<^kj = ^kT^k-l ■ ■ ■ T^j+lT^iT^J+i ■ ■ ■ T^k-l'^k = '^kPT^k ^ 

where /? is a perm of 11 (k — 2) that has the same cycle structure as vTi; hence p fixes exactly k — 2 — r 
points < k — 2. Consider what happens to vrfcpvr^ as VTfc runs through its (A;^ 1)! possible values: We 
obtain a uniform distribution over all perms of n{k — 1) having the same cycle structure as p. For 
example, if r = 7 and p = [1 2 7] [3 6] [4 9], the (k — 1)! perms TTfepvT^ are just [ai 02 ay] [03 a6][a4 og] 
as ai . . . Ofc-i runs through the images of all perms of n{k — 1). Therefore the probability that 
(^kjT^iC^kj ^ - 1) is 

// _ A / /A: - A ^ (/-l)(/-2)...a-r) 
V ^ // V r J (fc- l)(fe-2)...(fe-r) ■ 

Now let's compute the probability that jk-i = k — 1, ... , jg+i = q + 1, jg < q, and ji = given 
a subscript q in the range k > q > I. We will assume that nk-i, . ■ ■ , tTq+i, 7rg_i, . . . , ^2 have been 
assigned some fixed values, while iTk and TTg run independently through all of their {k — 1)! (q — 1)1 
possibilities. Under these circumstances we will prove that o'kj'^i'^kj'^qjq uniformly distributed 
over n{q — 1). 

Let p be a positive integer less than q. Let a G n{q) take q i-^ p and have the same cycle 
structure as TTj. Also let /? be an element of iT(g — 1). Then there is exactly one perm tt^ that will 
make aa~p = /3, namely 

TTq = ^"a7rp^_i...7rg"_i. 
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(This perm takes q i-^ q — 1 and fixes all points > q, so it meets the conditions necessary to be 
called TTq.) Moreover, when iiq has this value, the number of perms nk such that crkjTTiaf^j = a is 
independent of a, as wc have observed in the previous case. Therefore the probability that {jq = p 
and crkjTTiCr'j^ja~p = (3) is independent of /3, and independent of p. 

The uniform distribution of CTkjT^iC^kj^qv i^pli^s that we have {jk-i = A: — 1, . . . , jg+i = g + 1, 
jq < q, and ji = I) with probability l/l times the probability that {jk-i = k — 1, . . . , jg+i = q + 1, 
and jq < q), because the values jg-i ...j-z are uniformly distributed. And we know from the 
previous analysis that this probability is 

1 (q{q-l)...{q-r + l)-{q-l){q-2)...{q-r)\ r {q - I) . . . {q - r + I) 



l\ {k-\){k-2)...{k-r) J I {k - \) . . .{k - r) 

Finally, therefore, we can compute the probability that ji = when k, j, i, and I are given as 
above and TTj has i — r fixed points: It comes to 



1 



{k-l)...{k-r) 



^ l<q<k ^ 



< 



1 {l-l)...{l-r){l-r-l) 
I {k-l)...{k-r){l-r) 

1 , {l-l)...{l-r) 



I (fc- l)...(/c-r) ■ 
Since r > 2, we obtain the desired upper bound 

M 1 (^-l)G-2) 1 

This implies the desired lower bound 0,{n^) on the total multiplication time. We can, for example, 
sum over r2(n^) values {k,j,i,l) with I < i < < I < < j < jii < k < n; in each of these 
cases a multiplication will require 0(n) steps with probability at least 1 — {l/l + t^ /k'^) > 1/2 when 
n > 72. 

Since the average running time is O(n^), there must exist, for all n, a sequence of perms 
7r2, ■ ■ ■ , 7r„ that make the algorithm do r2(n^) operations. But it appears to be difficult to define 
such perms via an explicit construction. Nor is there an obvious way to prove the Q(n^) bound 
when the iterative implementation of step A2 is adopted in place of the recursive implementation, 
even in the totally random case. 

8. More meaningful upper bounds. The examples studied above show that it is misleading to 
characterize algorithms A and B by merely saying that they will process m perms of i7(n) with a 
worst-case running time of 0(ra^+mn^). In one sense this estimate is sharp, because we've seen that 
$7(n^) behavior may indeed occur; but our other examples, together with extensive computational 
experience, show that the procedures often run considerably faster in practice. 

We can improve the estimate of Section 4 by introducing another parameter. Let g be the 
order of the group r{n) that is generated. Then we have the following result: 
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Theorem. A transversal system for a perm group of order g generated by m perms of II {n) 
can be found in at most 0(n^{logg)^ /logn) + O (^n^ {log g)^) + 0{mn\ogg) steps, using at most 
Oip?' \ogg/\ogn) + 0{n{[oggY) memory cells. 

Proof. Let s{k) and t{k) be defined as before. Then g = YYk=i *(^)) ^^'^ the number of membership 

tests is m + X^^^i (s(^)i(fc) — s{k) + l). Each membership test involves at most 0{logg) multipH- 
cations by non-identity perms, because the number of indices k with ,s(k) > 1 cannot exceed 9{g), 
the total number of prime factors of g. This accounts for the term 0{mn\ogg) in the theorem. 

Moreover, each t{k) is at most 9{g) = 0{\ogg), as wc have argued before. Therefore we can 
complete the proof of the time bound by showing that Yl^=i{^{^) ~ ^) — 0{n\ogg/\ogn). 

Given n and s, let us try to minimize the product nfc=i subject to the conditions 

n 

s = ^^(sfc — 1) and 1 < < A; . 

k=l 

U Sk-i > Sk, we can interchange Sfc-i ^ Sk without violating the conditions; hence we may assume 
that si < S2 < ■ ■ ■ < Sn- Furthermore, ifl < Sfc-i < Sfc < k, we can decrease the product by 
setting (sfc_i, Sfe) ^ {sk-i ~ 1; + 1). Hence the product is smallest when we have Sk = k for as 
many large k as possible: 

Sn = n, Sn-l=n-l, Sq+i = q+l, Sq = r, Sq-i = ■ ■ ■ = Si = 1. 

Here q and r are the unique integers such that 

(2) - s - 1 = {D -r and 1 < r < q < n. 

(We assume that < s < (2).) The minimum product is 

n! 

Pin, s) = r— . 

?! 

The actual product in the algorithm is g > P(n, Yl^=i (*(^) "-*-))' hence our proof will be complete 
if we can show that 

\ log n / 

But this is not difficult. If s > ^n^ we have q < n/\/2, hence log P(n,s) = 0(nlogn) and the 
result holds. At the other extreme, if < s < n, we have P{n, s) = s + 1 and again the result is 
trivial. Otherwise we note that n — q > ls/n\, hence 

P(n.s)>^>,l-'-i>{^)-'-'; 

the relation (s/n)logn = 0(logP(n, s)) follows immediately. 

The space required to store the transversal perms akj is Ylk=i k{s{k) — l) = 0(n^ log^/logn). 
The space required to store the strong generators can be reduced to ^ A; t{k) summed over those k 
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with s{k) > 1, for if s{k) = 1 we have T{k) = T{k — 1). This sum has 0{logg) terms, each of which 
is 0{n\ogg). So the proof of the theorem is complete. 

Inspection of this proof shows that the running time is actually bounded by a slightly smaller 
estimate than claimed, namely 

Oiri^lnigflog^g) + 0{n^ln{gf ) + 0{mnln{g)) , where ln{g) = min(n, 6'(5()) . 

The space bound is, similarly, 0{v?\og^g) + 0[nln{g)^) ■ And the examples in Sections 5 and 6 
above show that even this improved bound might be unduly pessimistic; sometimes a judicious 
relabeling of points will speed things up. 

The storage occupied by strong generators is usually less than the storage required for perms 
of the traversal system, but it can be greater. For example, when n is even and the generators are 
respectively 

[n — l,n] 
[n — 3,n — 2][n — 1, n] 

[1,2] ... [n-3,n-2][n-l,n] 

then g = 2"/^ and the nln{g)^ term dominates. 

The values of Inig) and log,^ g are often substantially smaller than n, in perm groups of compu- 
tational interest. For example, the Hall-Janko group J2 has 5 = 2^ • 3^ • 52 • 7 and n = 100 (see [6]); 
here 6{g) = 13 and log^^i « 2.9. The unitary group Uq{2), which has order g = 2^^ • 3^ • 5 • 7 • 11, 
is represented as a perm group on n = 672 points in the Cayley library (see [10]); in this case 
ln{g) = 24 and log^g 3.5. Some representative large examples are Conway's perfect group -0, 
for which g = 2"^"^ ■ 3^ ■ 5'^ ■ 7'^ ■ 11 ■ 13 ■ 23, n = 196560, and log„ g ^ 3.6; and Fischer's simple group 
F^4, for which 5 = 2^1 • 3^*^ • 5^ • 7^ • 11 • 13 • 17 • 23 • 29, n = 306936, and log„ g 4.4. (See [3].) 

9. Historical remarks and acknowledgments. The algorithm described above is a variant of 
a fundamental procedure sketched by Sims in 1967 [8], which he described more fully a few years 
later as part of a larger body of algorithms [9]. The principal difference between the method of [9] 
and the present method is that Sims essentially worked with sets of strong generators satisfying 
the condition T(l) C T(2) C • • • C r(n). Thus, for example, when a G ^{n) he would test the 
product err for all strong generators r; the present algorithm tests err for such a only with the 
perms r of T{n), namely the given generators tt. His example, in which the group generated by 
[1,2,4,5,7,3,6] and [2,4] [3,5] required the verification of 54 products err, requires the testing of 
only 40 products in the present scheme. On the other hand, his method for representing the ^{k) 
as words in the generators was considerably more economical in its use of storage space, and space 
was an extremely critical resource at the time. Moreover, his way of maintaining strong generators 
blended well with the other routines in his system, so it is not clear that he would have regarded 
the methods of the present paper as an improvement. 

Polynomial bounds on the worst-case running time were not obvious from this original work. 
Furst, Hopcroft, and Luks showed in 1980 [5] that a transversal system and a set of strong generators 
could be found in O(n^) steps. (In their method the transversal system and strong generators 
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were identical.) The author developed the present algorithm a year later, while preparing to 
write Volume 4 of The Art of Computer Programming and while advising Eric W. Hamilton, an 
undergraduate student who was working on a research project with Persi Diaconis [4]. The present 
method became more widely known after the author discussed it informally at a conference in 
Oberwolfach on November 6, 1981; several people, notably Clement Lam, suggested clarifications 
of the rough notes that were distributed at that time. Eventually Professor Babai was kind enough 
to suggest that the notes of 1981 be published now, instead of waiting until Volume 4 has been 
completed. Those notes are reproduced with slight improvements in Sections 1-4 of the present 
paper. The author is grateful to the referees and to Profs. Babai and Luks for several penetrating 
remarks that prompted the additional material in Sections 5-8. 

Improved methods have been discovered in the meantime, notably by Jerrum [7], who has 
reduced the worst-case storage requirement to order n^. Babai, Luks, and Seress [2] have developed 
a more complicated procedure whose worst case running time is only 0{n^~^'^). 

The word "perm," introduced experimentally in the author's Oberwolfach notes, does not seem 
to be winning any converts. (In fact, Pratt himself has forgotten that he once made this suggestion 
in conversation with the author.) However, the proposal to use the notation tt" for inverses, instead 
of the usual 7r~^, has significantly greater merit, and the author hopes to see it widely adopted 
in future years. The shorter notation is easier to write on a blackboard and easier to type on a 
keyboard. Moreover, the longer notation is redundant, just as is redundant; in fact, 
stands for a~ raised to the first power! Thus there is no conflict between the two conventions, and 
a gradual changeover should be possible. 
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